Search CORE

2,991 research outputs found

Learning Visual Reasoning Without Strong Priors

Author: Courville Aaron
de Vries Harm
Dumoulin Vincent
Perez Ethan
Strub Florian
Publication venue
Publication date: 11/08/2017
Field of study

Achieving artificial visual reasoning - the ability to answer image-related questions which require a multi-step, high-level process - is an important step towards artificial general intelligence. This multi-modal task requires learning a question-dependent, structured reasoning process over images from language. Standard deep learning approaches tend to exploit biases in the data rather than learn this underlying structure, while leading methods learn to visually reason successfully but are hand-crafted for reasoning. We show that a general-purpose, Conditional Batch Normalization approach achieves state-of-the-art results on the CLEVR Visual Reasoning benchmark with a 2.4% error rate. We outperform the next best end-to-end method (4.5%) and even methods that use extra supervision (3.1%). We probe our model to shed light on how it reasons, showing it has learned a question-dependent, multi-step process. Previous work has operated under the assumption that visual reasoning calls for a specialized architecture, but we show that a general architecture with proper conditioning can learn to visually reason effectively.Comment: Full AAAI 2018 paper is at arXiv:1709.07871. Presented at ICML 2017's Machine Learning in Speech and Language Processing Workshop. Code is at http://github.com/ethanjperez/fil

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

FiLM: Visual Reasoning with a General Conditioning Layer

Author: Courville Aaron
de Vries Harm
Dumoulin Vincent
Perez Ethan
Strub Florian
Publication venue
Publication date: 18/12/2017
Field of study

We introduce a general-purpose conditioning method for neural networks called FiLM: Feature-wise Linear Modulation. FiLM layers influence neural network computation via a simple, feature-wise affine transformation based on conditioning information. We show that FiLM layers are highly effective for visual reasoning - answering image-related questions which require a multi-step, high-level process - a task which has proven difficult for standard deep learning methods that do not explicitly model reasoning. Specifically, we show on visual reasoning tasks that FiLM layers 1) halve state-of-the-art error for the CLEVR benchmark, 2) modulate features in a coherent manner, 3) are robust to ablations and architectural modifications, and 4) generalize well to challenging, new data from few examples or even zero-shot.Comment: AAAI 2018. Code available at http://github.com/ethanjperez/film . Extends arXiv:1707.0301

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

HoME: a Household Multimodal Environment

Author: Anand Ankesh
Brodeur Simon
Celotti Luca
Courville Aaron
Golemo Florian
Larochelle Hugo
Perez Ethan
Rouat Jean
Strub Florian
Publication venue
Publication date: 29/11/2017
Field of study

We introduce HoME: a Household Multimodal Environment for artificial agents to learn from vision, audio, semantics, physics, and interaction with objects and other agents, all within a realistic context. HoME integrates over 45,000 diverse 3D house layouts based on the SUNCG dataset, a scale which may facilitate learning, generalization, and transfer. HoME is an open-source, OpenAI Gym-compatible platform extensible to tasks in reinforcement learning, language grounding, sound-based navigation, robotics, multi-agent learning, and more. We hope HoME better enables artificial agents to learn as humans do: in an interactive, multimodal, and richly contextualized setting.Comment: Presented at NIPS 2017's Visually-Grounded Interaction and Language Worksho

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Actionable Intelligence-Oriented Cyber Threat Modeling Framework

Author: Cameron Lance
Elkins Aaron
Larson Lance
Perez Marc
Shin Bongsik
Publication venue: 'HICSS Conference Office'
Publication date: 03/01/2022
Field of study

Amid the growing challenges of cybersecurity, the new paradigm of cyber threat intelligence (or CTI) has gained momentum to better deal with cyber threats. There, however, has been one fundamental and very practical problem of information overload organizations face in constructing an effective CTI program. We developed a cyber threat intelligence prototype that automatically and dynamically performs the correlation of business assets, vulnerabilities, and cyber threat information in a scoped setting to remediate the challenge of information overload. Conveniently called TIME (for Threat Intelligence Modeling Environment), it repeats the cycle of: (1) collect internal asset data; (2) gather vulnerability and threat data; (3) correlate vulnerabilities with assets; and (4) derive CTI and alerts significant internal asset-related vulnerabilities in a timely manner. For this, it takes advantage of CTI reports produced by online sites and several NIST standards intended to formalize vulnerability and threat management

ScholarSpace at University of Hawai'i at Manoa

AIS Electronic Library (AISeL)

Recommended from our members

Advancing Immigrant Incorporation in Austin, TX

Author: Escajeda Aaron
McConnell Micaela
Perez Ana
Uruchima Tania
Wasem Ruth
Publication venue
Publication date: 01/01/2021
Field of study

Austin is one of the fastest growing cities in the United States and is identified as an emerging gateway for immigrants. The single largest source country for immigrants to Austin continues to be Mexico, but immigrants from Asia are increasing in numbers and relative proportion. Immigrants from Africa doubled over the past decade and make up 4 percent of the foreign-born population. In other words, Austin’s foreign-born residents are increasingly diverse. This report serves to inform the City of Austin as it advances its immigrant incorporation efforts and focuses on five policy areas that the City of Austin can advance: 1) leadership and governance; 2) civic engagement and inclusivity; 3) economic prosperity and job growth; 4) livability; and 5) community resilience.LBJ School of Public Affair

Texas ScholarWorks

La influencia de la improvisación musical sobre las habilidades técnico-interpretativas en violinistas de formación clásica. Recomendaciones de materiales para los estudiantes de violín de la Especialidad de Música de la Pontificia Universidad Católica del Perú

Author: Perez Albela Carranza Gabriel Aaron
Publication venue: 'Baishideng Publishing Group Inc.'
Publication date: 05/09/2023
Field of study

La improvisación musical es la creación espontánea de una línea melódica bajo un contexto armónico-rítmico determinado. La práctica de la improvisación desarrolla habilidades relacionadas con la creatividad, la espontaneidad y el oído armónico. A pesar de estos beneficios y según los resultados de la encuesta realizada, improvisar no forma parte de la práctica regular de los violinistas de la Especialidad de Música de la Pontificia Universidad Católica del Perú. Por ese motivo, la presente investigación tiene como objetivo general identificar de qué manera la improvisación musical influye sobre las habilidades técnicointerpretativas en violinistas de formación clásica. Además, se incluye una propuesta de materiales que permita introducir la improvisación en la formación de los estudiantes de violín de la Especialidad de Música de la Pontificia Universidad Católica del Perú. En la metodología se presenta un enfoque cualitativo descriptivo, a partir de la revisión bibliográfica y la entrevista semiestructurada a dos maestros violinistas improvisadores: Kostia Lukyniuk y Joshue Ashby. Además, se aplicó una encuesta a los violinistas de la Especialidad de Música de la PUCP, que permitió diseñar una propuesta de materiales de acuerdo con sus intereses y necesidades. Finalmente se realizó una selección de materiales ordenados para el acercamiento a la improvisación en el violín, que fueron ejecutados y analizados a través de sesiones de práctica auto-observada y documentada para sustentar la propuesta en la experiencia personal. Los resultados muestran que la improvisación beneficia al desarrollo de habilidades técnico-interpretativas relacionadas, fundamentalmente, con la producción del sonido, la ejecución de base armónica, los golpes de arco, el dominio de la afinación, el reconocimiento del diapasón, la espontaneidad y la creatividad

Repositorio Digital de Tesis PUCP